A UNIVERSALITY RESULT FOR THE GLOBAL FLUCTUATIONS OF THE 
EIGENVECTORS OF WIGNER MATRICES 



FLORENT BENAYCH-GEORGES 



Abstract. We prove that for [uij]"i = i the eigenvectors matrix of a Wigner matrix, under 
some moments conditions, the bivariate random process 

/ \ 

1 < z < n.s , 

V 1 ^'^ 11 ' / ( s ,t)e[o,i] 2 

converges in distribution to a bivariate Brownian bridge. This result has already been proved 
for GOE and GUE matrices. It is conjectured here that the necessary and sufficient condition, 
for the result to be true for a general Wigner matrix, is the matching of the moments of orders 
1, 2 and 4 of the entries of the Wigner with the ones of a GOE or GUE matrix. Surprisingly, 
the third moment of the entries of the Wigner matrix has no influence on the limit distribution. 



1. Introduction 

It is well known that the matrix U n = [ujj]™ J=1 whose columns are the eigenvectors of a 
GOE or GUE matrix X n can be chosen to be distributed according to the Haar measure on 
the orthogonal or unitary group. As a consequence, much can be said about the Uy's: their 
joint moments can be computed via the so-called Weingarten calculus developed in OHO], any 
finite (or not too large) set of itjj's can be approximated, as n — > oo, by independent Gaussian 
variables (see [181 E]) and the global asymptotic fluctuations of the | governed by a 

theorem of Donati-Martin and Rouault, who proved in [11] that as n — > oo, the bivariate cadlag 
process 

/ \ 

^v=vf E (Kii 2 -v») 

l<i<ns, 

V l<j<nt I ( Sl t)g[ ,l]2 

(where /5 = 1 in the real case and f3 = 2 in the complex case) converges in distribution, for the 
Skorokhod topology, to the bivariate Brownian bridge, i.e. the centered continuous Gaussian 
process (B Stt )( s ,t)e[o,i] 2 with covariance 

(1) E[B s , t B s , tt ,] = (min{s, s'} - ss')(mm{t, t'} - tt'). 

A natural question is the following: 
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What can be said beyond the Gaussian case, when the entries of the Wigner 
matrix X n are general random variables ? 

For a general Wigner matrijQ the exact distribution of the matrix U n cannot be computed 
and few works had been devoted to this subject until quite recently. One of the reasons is that 
while the eigenvalues of an Hermitian matrix admit variational characterizations as extremums of 
certain functions, the eigenvectors can be characterized as the argmax of these functions, hence 
are more sensitive to perturbations of the entries of the matrix. However, in the last three years, 
the eigenvectors of general Wigner matrices have been the object of a growing interest, due in 
part to some relations with the universality conjecture for the eigenvalues. In several papers 
(see, among others, [HI [151 EI])> a delocalization property was shown for the eigenvectors of 
random matrices. More recently, Knowles and Yin in [20J and Tao and Vu in [26] proved that if 
the first four moments of the atom distributions of X n coincide with the ones of a GOE or GUE 
matrix, then under some tail assumptions on these distributions, the Mjj's can be approximated 
by independent Gaussian variables as long as we only consider a finite (or not too large) set of 



In this paper, we consider the global behavior of the |ujd's. By global behavior, we mean that 
we consider functionals of the lull's that involve all of them, at the difference of the works of 
Knowles and Yin in 1201 and Tao and Vu in |26|. Our work is in the same vein as Silverstein's 



paper [22] or Bai and Pan's paper [2]. We prove (Theorem 2.3) that for Wigner matrices whose 
entries have moments of all orders, the process (-B™t)( s ,t)e[o,i] 2 has a limit in a weaker sense 
than for the Skorokhod topology and that this weak limit is the bivariate Brownian bridge if 
and only if the off-diagonal entries of the matrix have the same fourth moment as the GOE or 
GUE matrix (quite surprisingly, no hypothesis on the third moment is necessary). Under some 
additional hypotheses on the atom distributions (more coinciding moments and continuity), we 



prove the convergence for the Skorokhod topology (Theorem 2.6) 



This result was conjectured by Djalil Chafa'i, who also conjectures the same kind of universality 
for unitary matrices appearing in other standard decompositions, such as the singular values 
decomposition or the Housholder decomposition of matrices with no symmetry, as long as the 
matrix considered has i.i.d. entries with first moments agreeing with the ones of Gaussian 
variables. It would also be interesting to consider the same type of question in the context 
of band matrices, connecting this problem with the so-called Anderson conjecture (see e.g. the 
works of Erdos and Knowles |12[ll3j. of Schenker [21] or of Sodin [23], or, for a short introduction, 
the blog note by Chafa'i [7]). 

The paper is organized as follows. The main results are stated in Section [2j where we also 
make some comments on their hypotheses ; outlines of the proofs and the formal proofs are 
given in Section [3] ; and Section [4] is devoted to the definitions of the functional spaces and their 
topologies and to the proofs of several technical results needed in Section [3] 

Acknowledgements: It is a pleasure for the author to thank Djalil Chafa'i for having generously 
pointed out this problem to him and for the nice discussions we had about it. We also would 
like to thank Alice Guionnet, for her availability and her precious advices, and Terry Tao, who 
kindly and patiently answered several naive questions asked by the author on his blog. At last, 
we would like to thank the anonymous referee who pointed a mistake in the proof of Proposition 



2.10 and brought the paper [2] to the attention of the author. 



Wigner matrix is a real symmetric or Hermitian random matrix with independent, centered entries whose 
variance is one. Its atom distributions are the distributions of its entries. 
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2. Main results 

For each n, let us consider a real symmetric or Hermitian random matrix 

X — J_f T (n) l n 



(n) 

For notational brevity, x) • will be denoted by X; * . 

Let us denote by Ai < • • • < A n the eigenvalues of X n and consider an orthogonal or unitary 
matrix U n = [ui,j]fj=i such that 

X n = U n diag(Ai, ...,X n )U*. 
Note that U n is not uniquely defined. However, one can choose it in any measurable way. 
We define the bivariate cadlag process 







B lf-=\j^ £ (Kil 2 - 1 /") 



i l<i<ns, i 

\ l<j<nt } 

where (3 = 1 in the real case and (5 = 2 in the complex case. 



(M)e[o,i] 2 



Assumption 2.1. For eac/i n, i/ie random variables Xij 's are independent (up to the symmetry), 
with the same distribution on the diagonal and the same distribution above the diagonal. 

Assumption 2.2. For each k>l, sup n E[|xi 5 i| fc + |a?x,2 1 ] < °°- Moreover, 

(2) E[x x ,i] =E[xi, 2 ] =0, E[|x li2 | 2 ] = l 

and E[|xi 2I ] has a limit as n — >■ 00. 

The bivariate Brownian bridge has been defined in the introduction and the definitions of the 



functional spaces and their topologies can be found in Section 4.1 



Theorem 2.3. Suppose that Assumptions \2. l\ and \2.2\ are satisfied. Then the sequence 

(distribution(S n )) n >i 

has a unique possible accumulation point supported by C([0, l] 2 ). This accumulation point 
is the distribution of a centered Gaussian process which depends on the distributions of the 
Xij 's only through lim^^oo Efjxi^l 4 ], and which is the bivariate Brownian bridge if and only if 
W+ 0O E[|a;i i 2| 4 ] =4-/3. 

More precisions about the way the unique possible accumulation point depends on the fourth 



moment of the entries are given in Remark 2.8 



To get a stronger statement where the convergence in distribution to the bivariate Brownian 
bridge is actually stated, one needs stronger hypotheses. 

Assumption 2.4. The distributions of the entries of X n are absolutely continuous with respect 
to the Lebesgue measure. 

The following hypothesis depends on an integer m > 2. 
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Assumption 2.5. For each k > 1, sup n E[|xi 5 i| fc + |xi i2 | fc ] < oo. Moreover, for each r,s>0, 

(3) r + s < m - 2 => E[K(x 1 , 1 ) r 9(x 1 , 1 ) s ] = E[3?( gijl ) r Qf( ffljl ) s ] 

and 



(4) 



r + s < m 



E[K(x 1)2 ) r 3r(x li2 ) s ] = E[SR( 5 i, 2 ) r 3( 5 i i2 ) s ]. 



where the gij 's are the entries of a standard GOE or GUE matrix. 



Theorem 2.6. Suppose that Assumptions 2.1 and 2.4 are satisfied, as well as Assumption 2.5 
for m = 12. Then, as n — > oo, the bivariate process B n converges in distribution, for the 
Skorokhod topology in D([0, l] 2 ), to the bivariate Brownian bridge. 



Remark 2.7. Weakening of the assumptions. As explained above, the di strib utions of the 

below (which 



entries x 



» 



are allowed to depend on n. In order to remove Assumption 



we did not manage to do yet), it mights be useful to weaken Assumptions 2.2 and 2.5 in the 



following way: one can easily see that the proofs of Theorems 2.3 and 2.6 and Proposition 2.10 



2.4 



still work if, in Equations @, ^ and Q, one replaces the identities by the same identities up 
to an error which is 0(n~ a ) for all a > 0. 



Remark 2.8. Complements on Theorem 2.3 , One can wonder how the unique accumulation 
point mentioned in Theorem 2.3 depends on the fourth moment of the entries of X n . Let 

G := (G s , t ) (s . 

,t)e[o,i] 2 be distributed according to this distribution. We know that (G Stt )( s ,t)e[o,i]- 2 
is the bivariate Brownian bridge only in the case where lim^^oo E[|xi i2 | 4 ] =4-/3. In the 
other cases, defining -F S emicircie as the cumulative distribution function of the semicircle law, the 
covariance of the centered Gaussian process 

i-2 



(5) 



u k G 



An) 



du 



se[o,i],fc>o 



can be computed thanks to Lemma 3.4 and Proposition 3.5 By Lemma 3.3, it determines 



completely the distribution of the process G. However, making the covariance of G explicit out 
of the covariance of the process of ([5]) is a very delicate problem, and we shall only stay at 
a quite vague level, saying that it can be deduced from the proof of Proposition |3.5| that the 

|4l 



variances of the one-dimensional marginals of G are increasing functions of lim™-^ E[|xi )2 



More specifically, one can deduce from Lemma 3.4 and Proposition 3.5 that for all < s\, s 2 < 1, 
/ r2 



Cov 



u 2 G 



scmicirc 



le (u)d«, 



u 2 G 



*2. 



scmicirc 



le(") 



du 



E[|xi, 2 | 



1 



min{si, s 2 } - sis 2 ). 



Remark 2.9. Comments on the hypotheses of Theorem 2.6 (1). In order to prove 
the convergence in the Skorokhod topology, we had to make several hypotheses on the atom 
distributions: absolute continuity, moments of all orders and coincidence of their 10 (on the 
diagonal) and 12 (above the diagonal) first moments with the ones of a GOE or GUE matrix. 
We needed these assumptions to control the discontinuities of the process B n . Even though 
these hypotheses might not be optimal (especially the continuity one), a bound on the tails 
of the atom distributions seems to be necessary to avoid too large variations of the process 
B n . Indeed, as illustrated by Figure [TJ for a GOE matrix (left picture), |iij,j| 2 is close to 1/n 
for all i,j with high probability, whereas when the atom distributions have not more than a 
second moment (right picture), the matrix X n looks more like a sparse matrix, and so does U n , 
which implies that for certain (i,j)'s, |wjj| 2 — l/n is not small enough. Since |itjj| 2 — 1/w is 
the jump of the process B n at (s, t) = (i/n,j/n), this could be an obstruction to the existence 
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Figure 1. Influence of the tails of the atom distributions of X n on the 

|uij|'s: Plot of the map (i/n,j/n) i — > \\uij\ 2 — l/n| for two different choices 
of atom distributions. Left: GOE matrix. Right: Wigner matrix with atom 
distribution admitting moments only up to order 2 + e for a small e. For both 
pictures, the matrices are n x n with n = 50. 



of a continuous limit for the process B n . That being said, we have hopes to prove the theorem 



under Assumption 2.5 for m = 4 instead of m = 12 (see Remark 2.11 bellow). 



Note that it follows from the previous theorem that for all < s < s' < 1 and < t < t' < 1, 
the sequence of random variables 

vV -«)(*-*) £ (l«*^l a - V«) 

v v /v ' ns<i<ns' 

nt<j<nt' 

admits a limit in distribution as n — > oo, hence is bounded in probability (in the sense of |244 
Def. 1.1]). In the same way, it follows from [20] and [2^] that the sequence n\uij\ 2 — 1 is bounded 
in probability. In the next proposition, we improve these assertions by making them uniform 
on s, s' ,t,t' and upgrading them to the I? and L 4 levels. The proof of the proposition is 
postponed to Section |4.5| 



Proposition 2.10. Suppose that Assumptions 2.1, 2.4 and 2.5 for m = 4 (resp. m = 8) are 
satisfied. Then as n 



(6) 



n\u 



oo, the sequence 
1 (resp. 



1 



^(s'-s)(t'-t) 



E (i 

ns<i<ns' 
nt<j<nt' 



h3 



1/n)) 



is bounded for the L 4 (resp. L 2 ) norm, uniformly in i,j (resp. s < s',t < t 1 ,). 



Remark 2.11. Comments on the hypotheses of Theorem 2.6 (2). This proposition is 



almost sufficient to apply the tightness criterion that we use in this paper. Wo uld the second 
term of ^ have been bounded for the L 2+e norm (instead of L 2 ), Assumption 2.5 for m = 8 
would have been enough to prove that B n converges in distribution, for the Skorokhod topology 
in -D([0, l] 2 ), to the bivariate Brownian bridge. 
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3. Proofs of Theorems 12.31 and 12.61 

3.1. Outline of the proofs. Firstly, Theorem |2 . 6| can be deduced from Theorem |2.3| by proving 
that the sequence (distribution(B n )) n >i is tight and only has C([0, l] 2 )-supported accumulation 
points. This can be done via some upper bounds on the fourth moment of the increments of 



B n and on its jumps (or discontinuities). These bounds are given in the proof of Lemma 3.1 
below, and rely on the existing bounds in the case where X n is a GOE or GUE matrix and on 
the "one-by-one entries replacement method" developed by Terence Tao and Van Vu in recent 
papers, such as [25| [26]. 

Secondly, the proof of Theorem |2.3| relies on the following remark, inspired by some ideas 
of Jack Silverstein (see Chap. 10] and [22J): even though we do not have any "direct 
access" to the eigenvectors of X n , we have access to the process (B™ F ( u \) s e[o,i],«eiR> f° r 

F »x n ( u ) '■= ^#0; X i ^ u )- Indeed, 




hence for all fixed s G [0,1], the function u G K i — > B n „ , is the cumulative distribution 
function of the signed measur^] 




which can be studied via its moments 



n 



l<i<ns 

the ei's being the vectors of the canonical basis. From the asymptotic behavior of the moments 
of the signed measure of ([7]), one can then find out the asymptotic behavior of its cumulative 
distribution function. 

Once the asymptotic distribution of the process (B™ F r u -\) s e[o,i],ueU identified, one can 

obtain the asymptotic distribution of the process (-B"t)<je[o,i],te[o,i] because the function F^ Xn 
tends to the (non random) cumulative distribution function -F se micircie of the semicircle law. 

3.2. Formal proofs. By a standard tightness + uniqueness of the accumulation point argument, 



Theorem 2.6 will follow from the following lemma and Theorem 2.3. The proof of the lemma is 



postponed to Section |4.5| 

Lemma 3.1. Under Assumptions 2.1, 2.4 and 2.5 form = 12, the sequence (distribution(i? n )) ri >i 



is C-tight, i.e. is tight and has only C([0, 1] ) -supported accumulation points 



So let us now prove Theorem |2 . 3| under Assumptions |2 . 1] and |2 . 2| Note that it suffices to prove 
that the sequence (distribution(B n )) n >i has a unique possible accumulation point supported 
by C([0, l] 2 ) and that this accumulation point is the distribution of a centered Gaussian process 
which depends on the distributions of the a^j's only through linin^oo Efjxi^l 4 ] (and actually 



2 Note that the random probability measures l M »>j'| 2 £vj have been studied in [5] (convergence to the 

semicircle law and fluctuations). Here, we are somehow considering the fluctuations of large sums of such random 
probability measures (1 < i < ns) around the empirical spectral law ~ X^j=i $>-j ■ 
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does depend on linin^oo Eflxi^l 4 ]). Indeed, in this case, by Theorem 1.1 of [TT], where the case 
of GOE and GUE matrices is treated, this limit distribution is the bivariate Brownian bridge if 
and only if limn^oo IE[|xi ; 2 1 4 ] is the same as for X n a GOE or GUE matrix, i.e. equal to 4 — /3. 

The following proposition is the key of the proof, since it allows to transfer the problem from 
the eigenvectors to some more accessible objects: the weighted spectral distributions of X n . 



Proposition 3.2. Let ej denote the ith vector of the canonical basis. To prove Theorem 2.3, it 
suffices to prove that each finite- dimentional marginal distribution of the process 



se[o,i],fc>i 

converges to a centered Gaussian measure and that the covariance of the limit process depends 
on the distributions of the Xij's only through lim n _ ) . 00 Efjxi^l 4 ] and actually does depend on 
Hindoo E[| 0:1,2 1 4 ]. 

Proof. Let [ix n '■= \ J2i=i be the empirical spectral law of X n and F )lx (u) := ^#{« ; Aj < 
u} be its cumulative distribution function. It is a well known result [31 Th. 2.5] that F^ Xn 
converges in probability, as n tends to infinity, to the cumulative distribution function ^semicircle 



of the semicircle law in the space D C ($L, [0, 1]) (see Section 4.1 for the definitions of the functional 
spaces and their topologies). Moreover, the map 

A)([0,1] 2 ) x £> C (K,[0,1]) — >• D c ([0,l]xR) 

,u)e[o,i]xt 

is continuous at any pair of continuous functions. Hence for any continuous process (-Bs,t)s,te[o,il 
whose distribution is an accumulation point of the sequence (distribution(B n )) n >i for the Sko- 
rokhod topology in D([0, l] 2 ), the process 

( B s,F MXn («))(s,«)e[0,l]xR 
converges in distribution (up to the extraction of a subsequence) to the process 

( B S,F scmicilclc (u)){s,u)e[0,l]xR 

(for the reader who is not used to weak convergence of probability measures, this assertion relies 
on two results which can be found in [6]: Theorem 4.4 and Corollary 1 of Theorem 5.1 in Chapter 
1). Now, note that ^semicircle '■ R — > [0, 1] admits a right inverse, so the distribution of the process 
(fi s ,t) S) te[o,i] is entirely determined by the one of the process (-B s ,F scmicirclc («))(s,«)e[o,i]xR- 
As a consequence, to prove Theorem |2.3[ it suffices to prove that the sequence 

n>l 



( 8 ) (distribution(( J B^ F {u) ){ s ,u)e[o,i] 



has a unique possible C c ([0, 1] x M)-supported accumulation point as n — > oo. 

Now, note that any / G C c ([0, 1] x M.) is entirely determined by the collection of real numbers 
(J ueK u k f(s, ^)dn) sg [ 0i i] i fc>o- Let us prove that this fact remains true in the "distribution sense" 
in the case where / is random. More precisely, let us prove the following lemma. 

Lemma 3.3. Let f be a random variable taking values in C c ([0, 1] xl) such that with probability 
one, f(s,u) = when \u\ > 2. Then the distribution of f is entirely determined by the finite 
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dimensional marginals of the process 



(9) / u k f(s,u)du 

\JueR j se[o,i],fe>o 

Moreover, in the case where the finite dimensional marginals of the process of ^ are Gaussian 
and centered, then so are the ones of f. 

Proof. Let us fix (s, no) £ [0,1] x [—2,2] and let, for each p > 1, (P p ,q) q >i be a sequence of 
polynomials that is uniformly bounded on [—3,3] and that converges pointwise to t[ UOjUo +i/ p ] 
on [—3,3]. Then one has, with probability one, 

ru + l/p r 

f(s,uo)= limp / f(s,u)du= lim limp / P p q (u)f(s,u)du. 

Ju p^ooq^oo J um ' 

This proves the lemma, because any almost sure limit of a sequence of variables belonging to a 
space of centered Gaussian variables is Gaussian and centered. □ 

Since the fourth moment of the entries of X n is bounded, by e.g. [HJ Th. 5.1], we know that 
the extreme eigenvalues of X n converge to —2 and 2. As a consequence, for any random variable 
/ taking values in C c ([0, 1] x R) such that the distribution of / is a limit point of the sequence 
of Q, we know that with probability one, f(s, u) = when \u\ > 2. 

As a consequence, it follows from the previous lemma and from what precedes that to prove 



Theorem 2.3, it suffices to prove that each finite dimensional marginal distribution of the process 



u*b: jF (u) d U 

converges to a centered Gaussian measure and that the covariance of the limit process depends 
on the distributions of the Xij's only through lim n ^.oo Ef^i^l 4 ] and actually does depend on 
lim^ooEflxi^l 4 ]. 

But for all (s, u) G [0, 1] x R, 

^W«) = VI £ £ (K;l 2 -Vn), 



hence 

( 10 ) B ",F W («) = \/ 9 ^ F HX n , H -HX n ( U )> 



1< i<ns 



where: 



fix n is still the empirical spectral law of X n , 

Hx n ,ei is the weighted spectral law of X n , defined by fix n , ei ■= Y%=i K,il 2 ^Aj> 

Ffj, x e _ -[i Xn is the cumulative distribution function of the null-mass signed measure 



The following lemma will allow to conclude the proof of Proposition 3.2 

Lemma 3.4. Let jj, be a compactly supported null-mass signed measure and set F^{u) :- 
fi({—oo,u\). Then for all k > , 

vf'F^du = - / — — dfi(x). 



fc + 1 
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Proof. Let a < b be such that the support of fx is contained in the open interval (a, b). is 
null out of (a, b) and satisfies Fu(u) = —//((it, b)), so by Fubini's Theorem, 

f f b f x f b —x k+1 a k+l f b —x k+1 

u k F^{u)du = -u k dudfi{x) = ——dfi(x) + -— f i((a,b))= — — -dp(x). 

JueR Jx=a Ju=a J x=a K T 1 K -\- 1 J x=a K T 1 

□ 



It follows from this lemma and from (10) that for all s € [0, 1], k > 0, 



which proves Proposition 3.2 □ 



Now, Theorem |2.3| is a direct consequence of the following proposition, whose proof is post- 



poned below to Section 4.2 



Proposition 3.5. Under Assumptions 2.1 and 2.2, each finite- dimentional marginal distribution 
of the process 



(e*X k e t -lTr(X k ))\ 

^l<i<ns J 



se[o,i],fc>i 

converges to a centered Gaussian measure. The covariance of the limit distribution, denoted by 

(CoV Sl]S2 (fci, ^2)) SliS2e [ 0i i] >fcl)A , 2 >i , 

depends on the distributions of the Xij's only through lim n _ > . 0O E[|a;i2| ]• Moreover, we have 
Cov gljS2 (2,2) = ( lim E[|xi j2 | 4 ] - l)(min{si, s 2 } - s 1 s 2 ). 

n— >oo 

4. Proofs of the technical results 

4.1. Functional spaces and associated topologies. In this paper, we use several functional 
spaces: 

• C([0, l] 2 ) (resp. C c ([0, 1] x M)) is the set of continuous functions on [0, l] 2 (resp. com- 
pactly supported continuous function on [0, ljxffi), endowed with the uniform convergence 
topology. 

• D C (M, [0, 1]) is the set of compactly supported cadlag functions on R taking values in 
[0, 1], endowed with the topology defined by the fact that /„ — > f if and only if the 
bounds of the support of /„ tend to the ones of the support of / and for all M > 0, after 
restriction to [— M, M], f n — > f (the topology of D([—M,M]) being deduced from the 
one of D([0, 1]) defined in [6J). 

• D([0, l] 2 ) (resp. D c ([0, 1] x R)) is the set of functions / : [0, l] 2 — > R (resp. of compactly 
supported functions / : [0, 1] x R — > R) admitting limits in all "orthants", more precisely 
such that for each so,to, for each pair of symbols o,o' £ {<, >}, 

lim f(s,t) 

sos J y ' ' 
to'to 

exists, and is equal to /(so>*o) if both o and o' are >. The space D([Q, l] 2 ) is endowed 
with the Skorokhod topology defined in [5] and the space D c ([0, 1] x R) is endowed 
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with the topology defined by f n — > f if and only if for all M > 0, after restriction to 
[0, 1] x [-M, M], f n — > f (the topology of D([0, 1] x [-M, M}) being deduced from the 
one of £>([0,1] 2 )). 

• -Do([0, l] 2 ) is the set of functions in D([0, l] 2 ) vanishing at the border of [0, l] 2 , endowed 
with the induced topology. 



4.2. Proof of Proposition 3.5 , Note that by invariance of the law of X n under conjugation 
by any permutation matrix, the expectation of the random law nx„,ei does not depend on i. So 
for all s G [0,1], k > 1, 
(11) 

]T (epf** - \ Tv(X k n )) = {<*fe - ne*X k n e t ]) - M £ (e * X * e . _ E[e*X* ei )). 



Ki<ns 



Ki<ns 



n 



Ki<n 



Hence we are led to study the limit, as n — > oo, of the finite-dimentional marginal distributions 
of the process 



(12) 



(e*X^-E[e*X*e 4 ]) 



1< i<ns 



sS[0,l],fc>l 



The random variables of (12) are going to be studied via their joint moments. So let us fix 
P > 1) si, . . . , s p £ [0, 1] and fei, . . . , k p > 1. We shall study the limit, as n tends to infinity, of 



(13) 



We introduce the set 



nil E (e*X^e t -E[e*X^ ei ])]. 

i=l l<i<ns£ 



(14) 



£ :={0\. 



M }U 



u {o p , . 



) ™p Si 



where the sets {0,1,.. .}, {0 , 1 ,...}, , {0 P , l P , . . .} are p disjoint copies of the set of 

nonnegative integers. The set £ is ordered as presented in ( |14| ). For each partition tt of £ , for 
each x £ £, we denote by ir(x) the index of the class of x, after having ordered the classes 
according to the order of their first element (for example, tt(1 ) = 1; 7r(2 ) = 1 if 1 ~ 2 and 
vr^ 1 ) = 2 if T 1 & 2 1 ). 



By Assumption 2.1, the expectation of (13) can be expanded and expressed as a sum on the 



set Part(£) of partitions of the set £ introduced above. We get 

(15) E[n? = i T,i<i<nsM X nei ~ ne*Xtei})} 



k-t H hfcp 



n 



£ A(n, vr)E 

7rePart(£) 



nLi (^ov^ 



5 r fi-t. — E[x ,jrf. , T £. • • • X .■= ,-jpt,. 

Tr(k e -l ),5T(fc/ ) L tt(0 ),7r(l ) 7r(fc^-l ),7r(fcf )- 



where for each tt E Part (5), A(n,ir) is the number of families of indices of 



whose level sets partition is tt and who satisfy, for each £ = 1, . . . ,p, 



(16) 



1 < i=e = L-t < nsc. 

— fcf - 
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For any ir £ Part (5), let us define G n to be the graph with vertex set the set {tt(x) ; x G £ } 
and edge set 



E v := { {7r(m- V),ir(fnr)} ; 1 < £ < p, m G {1, . . . , k e } }. 



For the term associated to tt in (15) to be non zero, we need to have: 



(i) 7r(0 ) = ix(k.£ ) for each £ = 1, . . . ,p, 

_£ £ 

(h) each edge of G w is visited at least twice by the union of the p paths (ir(0 ), , ixikg )) 

(£ = l,...,p), 

(iii) for each £ = 1, . . . ,p, the exists £' ^ £ such that at least one edge of G n is visited by 
both paths (ir(0 ), ,ft(k£ )) and (7r(0 ), ,Tt{ke> )). 



Indeed, (i) is due to (16), (ii) is due to the fact that the Xi j s tire centered and (iii) is due to 
the fact the the independent. 

Let us define a function s(-) on the set £ in the following way: for each £ = 1, . . . ,p and each 
m = 0, . . . , ke, set 

J sg if m = or m = ke, 
I 1 otherwise, 



s(m l ) 



and 

(17) s n := II mins(x). 



B bloc of TT 



Then one can easily see that, as n — > oo, 

(18) A(n,7r)~s w n w . 



Thus for 7r to have a non zero asymptotic contribution to (15), we need the following condition, 
in addition to (i), (ii) and (iii): 

( iv ) fci+-+fc P < (7r |. 

The following lemma is a generalization of (U Lem. 2.1.34]. Its proof goes along the same 
lines as the proof of the former. 

Lemma 4.1. Let ir G Part(£) satisfy (i), (ii), (iii). The the number c n of connected components 
of G n is such that cv < p/2 and 

p ki H h k p 

kl<c.-- + • 

As a consequence, if tt also satisfies (iv), we have 

(a) c n = p/2, 

(b) p is even, 

(c) | vi" | = fcl+ 2 +fcp ( so that k± -\ + k p is also even). 

Note also that by (ii), we also have 

(d) \E n \ < fcl+ ' 2 + V 
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(19) 



To sum up, by (15) and (18), we have 

lim^oo E[nLl El<i<nsM X n^ ~ 



s n linin-Kxj E n?=i [ x n ^ ,-i,---x-, — T e, — <, - E\x ,-, , -•.■■■■>■ 



t(0'),^(1') 



7v(k£-l ),TT(k t ) 



7r(0'),7r(r) 



n(k£— 1 ),7r(fe^ )• 



where the sum is taken over the partitions it of £ which satisfy (i), (ii), (iii) and (iv) above, and 
such partitions also do satisfy (a), (b), (c) and (d) above. 

Case where p is odd: By (b), we know that when p is odd, there is no partition tt satisfying 
the above conditions. So by (19), 

p 

hmEfQ (e*X^e i -E[e*X^e l ])]=0. 

1=1 l<i<ns e 



Case where p = 2: In this case, by (a), for each partition tt satisfying (i), (ii), (iii) and (iv) 
G n is connected, so that | vr | — 1 < l-E^I- Hence by (c) and (d), \E W \ is either equal to 



kl+k2 



or 



to 



ki+k 2 



1. 



In the case where I.EU 



1, the graph has exactly one more vertex than edges, 



ki+k 2 
2 

—1 — 1 —2 — 2 

hence is a tree. As a consequence, the paths (ir(0 ), , 7C(ki )) and (ir(0 ), ,7r(£;2 ))j 

which have the same beginning and ending vertices, have the property to visit an even number 
of times each edge they visit. By an obvious cardinality argument, only one edge is visited more 
than twice, and it is visited four times (twice in each sense). The other edges are visited once 
in each sense. It follows that the expectation associated to tt in (pb is equal to K[|xi >2 | 4 ] - 1. 

In the case where l-E^I = kl ~^ k2 , by a cardinality argument again, we see that each edge of 
is visited exactly twice (in fact, the configuration is the one described in [I] Sect. 2.1.7], where 



is a bracelet). It follows that the expectation associated to tx in (19) is equal to 1. 



As a consequence, as n tends to infinity, 



E 



Y, (e*Xte t -E[e*Xte l ])x £ (e*A n fe2 e 4 - E[e* X^e t ]) 

l<i<nsi l<i<ns2 



converges to a number that we shall denote by 



(20) 



Cov 



centered 

S1,S2 



and which depends on the distributions of the Xij's only through limn-^oo Efjxi^l 4 ]. 



Case where p is > 2 and even: By (a) above, for each partition tt satisfying (i), (ii), (iii) and 
(iv), G n has exactly p/2 connected components. By (iii), each of them contains the support of 
exactly two of the p paths 

(21) (vr(o'), ,tt(Y/)) (£=l,...,p). 

Let us define a n to be the matching (i.e. a permutation all of whose cycles have length two) of 
{1, . . . ,p} such that for all £ = 1, . . . ,p, the paths with indices £ and a n (£) of the collection ( J2l| ) 
above are supported by the same connected component of G n . 
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We shall now partition the sum of ( 19 ) according to the value of the matching a w defined by 
it. We get 

f (e* i X%e i -E[e*Xfre i })} = 
E\x 



lim n ^ooE[nf =1 E 



l<i<nsi ' 



E CT Ett s n lim„- 



E 



X fn e \ rT e \ ' ' ' X n T e \ IT- e \ ~ ^[ X frA IT t \ ' ' ' X n iA /T~ e \ 

7r(0 ),7r(l ) ir(k e -l ),n(k e ) tt(0 ),tt(1 ) n{k t -l ),n(k e )■ 



where the first sum is over the matchings a of {1, . . . , p} and the second sum is over the partitions 
it satisfying (i), (ii), (iii) and (iv) such that a n = a. 

Note that for each matching a of { 1 , . . . , p} , the set of partitions it of £ such that a n = a can 
be identified with the cartesian product, indexed by the set of cycles {£,£'} of a, of the set of 
partitions r of the set 



£e,£' := {0 , 

satisfying the following conditions 



} u {o l 



} (subset of £ ) 



(i.2) r((f ) = r(fc/) and r((f ) = r(k/ ), 

(ii.2) each edge of the graph G T is visited at least twice by the union of the 2 paths (t(0 ), , r(kg )) 

and (t(0 ), ,r(k e )), 

(iii. 2) at least one edge of G T is visited by both previous paths, 
( iv .2) k -i+hL < | T |. 

Moreover, by independence of the random variables Xij's, the expectation 

p 



E 



n 



=i 



V)^(iV" x ^-i'),^/) 



E[x 



AO ),7r(U) 



7r(A^-l ),-ir(k e )■ 



factorizes along the connected components of G n . The factor s w , defined in (17), also factorizes 
along the connected components of G^. It follows that we have 



lim^ E[IlLi Ei< i<ns >*^ " n^X^et])] 



E CT U{e,£'} lim n 



E 



E 



Ki<ns 



,(e*X^ ei -E[e*X^])x£ 



l<i<ns e . 



i (e*Xn l 'e i -E[e*Xn e 'e l }) 



where the sum is over the matchings a of {1, . . . ,p} and for each such a, the product is over the 
cycles {£, £'} of a. 

By the definition of Cov^ 2 ered (A;i, k 2 ) in (|oJ), we get 



limEfQ ]T (e*Xte i -E[e*X%e i })]= £ 



n 



oovs°'r d (*'>*<' 



£=1 l<j<n S( , 



a matching {£,£'} cycle of a 



By Wick's formula and Equation (11), we have proved the first part of Proposition 3.5 



Computation of Cov sljS2 (2, 2): We have, by the paragraph devoted to the case p = 2 above, 
Cov— d (2,2)= ^xOim E[|x li2 | 4 ]-1) + £ 

7r, Gtt is a tree it, is a bracelet 

There is exactly one tree and zero bracelet with 2 vertices. We represent this tree in the following 
way: • — o. There are two associated partitions it: 
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- the first one is denned by 

MSVClVCS 1 )) = K0 2 ),vr(T 2 ),42 2 )) = (.,o,.), 

and satisfies s n = min{si, s 2 }, 

- the second one is defined by 

(jrCUVClV^ 1 )) = (•,«,•) , (vr(0 2 ),vr(T 2 ),7r(2 2 )) = (o,.,o), 
and satisfies = sis 2 . 

As a consequence, Cov^* 2 ered ( 2 > 2) = (lim^oo E[|xi j2 | 4 ] - l)(min{si, s 2 } + sis 2 ). 

From this formula and Equation ( |11[ ), we easily deduce the formula of Cov Sl)S2 (2, 2). It 
concludes the proof of Proposition |3.5| □ 

4.3. A preliminary result for the proofs of Proposition 2.10 and Lemma 3.1[ Propo- 
sition |4.3| below will allow to prove Lemma |3.1| Let us now consider two independent Wigner 



matrices X n , X' n . Let us introduce a modified version of Assumption 2.5, also depending on an 
integer m > 2: 

Assumption 4.2. For each k > 1, sup n E[|xi ) i| fc + |xi )2 | fc + \x'i i\ k + \%i 2I ] < 00 ■ Moreover, 
there exists £q > such that for each r,s > 0, 

(22) r + s < m - 2 => E[K(xi,i) r 9(xi,i) s ] - E[K(x , 1>1 ) r 9(x / ljl ) s ] = O(n- eo+1+!± ^ m ) 
and 

(23) r + s<m =► E[K(xi, 2 ) r Q(xi, 2 ) s ] - E[3?(x' 1 2 ) r %{x' 1 2 ) s ] = O{n~ eo+r± ^). 



Let U' n = [u'ij]fj = i be the associated eigenvectors matrix of a matrix X' n (like £/ n = |//, 



in 

for XA 



Proposition 4. 3. W ^e suppose that X n , X' n both satisfy Assumptions \2J\ and 2.4-, and sat- 
isfy Assumption Let us fix a positive integer k and a polynomial function G on C k . 
For each n, let us consider a collection (h,pi, qi), ■ ■ ■ , (ik)Pk-, %) G {L---) n } 3 (this collec- 
tion might depend on n). Then for a certain constant C independent of n and of the choices of 
(h,pi, qi), . . . , (ik,Pk, qk) (but depending on k and on G) 

(24) I E[G(nu PlM u quil , . . . ,nu Pk , lk u qk>ik )] -E[G(nu puh u quil , . . . ,nu pk)lk u Qkylk )] \ <Cn 2 -^. 

Proof. The proof follows the ideas developed by Terence Tao and Van Vu in recent papers, 
such as \25\ [26] . More precisely, it makes an intensive use of the strategy and of some estimations 
given in |26j. 

An event E = E^ depending on the parameter n will be said to be true with overwhelming 
probability if for all c > 0, 1 — ¥(E) = 0(n~ c ). One can neglect any such event in the proof of 
(24) because G has polynomial growth and the entries of unitary matrices are bounded by 1. 



By Assumption |2.5| and Chebyshev's inequality, that for each k, as t — > 00, 



\ x i,j\ >t) = 0(t k ) (uniformly in i,j and n), 



so that by the union bound, P(maxi<jj< n \xij\ > n e ) = 0(n 2 ~ ke ) for any e, k. Hence by what 
precedes, for a fixed E\ (a parameter fixed later), one can suppose that for each n, the Xjj's 
satisfy |xy| < n £l for all Of course, one can also suppose that |x£ -| < n Sl for all i,j. 
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To prove (24), it suffices to prove that we can replace the Xij's by the x l i j's one by one up 
to an error in the considered expectation which is 0(n l ~ t) for each diagonal replacement and 
0(n~~2~) for each off-diagonal replacement (and, of course, that these bounds on the errors are 
uniform in the n ( n 2 +1 ) replacements). 

So let us fix 1 < p < q < n and define, for each z £ C (or z G M if p = q), A Piq (z) be the 
matrix obtained by "mixing" the (rescaled) matrices X n and X' n in the following way: 

• for each 1 < i < j < n such that is below (p, q) for the lexicographic order, the i,j 
entry of A Pt „(z) and its symmetric one are the corresponding ones in the entries of nX' n , 

• the p, q entry of Ap <q (z) and its symmetric one are z and z, 

• for each 1 < i < j < n such that is above (p, q) for the lexicographic order, the i,j 
entry of A Ptq (z) and its symmetric one are the corresponding ones in the entries of nX n . 

We set U(z) = [ui,j(z)]ij = i to be the eigenvectors matrix of A p ^ q (z) and 

F(z) := G(nu piAl (z)u quh (z), . . . ,nu PkAk (z)u qktik (z)). 

Let E[ • | j4p jg (0)] denote the conditional expectation with respect to the u-algebra generated by 
the entries of A Ptq (0). We have to prove that with overwhelming probability, we have, uniformly 
on the probability space and on p,q, 

(25) E[F(^x p , q ) | A m (0)} = E[F{^x' m ) \ A p>q (0)\ + 0(n^-f ). 

To prove it, we shall use the Taylor expansion of F around zero. Let us for example treat the 
case p ^ q. For \x\ < n 61 , we have 



/ 

F(V^x)= Y, ^ r ^ (2) a| (2) F(0)S(x) s 9(x) s +O 



n 2 



r\s\ 

r+s<m 



n (m+l)( £l +l/2) gup \d^ (z) dL z) F( Z ) 
r+s=m+l 

\ \z\<n E i +1 / 2 



Moreover, by [26, Propo. 20], we know that with overwhelming probability, A Ptq (0) is a good 
configuration (in the sense of [26, Def. 18]). By Equation (27) of [2^1 Lem. 24], it implies that 
with overwhelming probability, for all I = 1, . . . , k, for all r, s such that r + s < 10, 

™p |$ W ^ W (nW*)*W*)|) = 0(» El_(r+s) ) 

| z |< n l/2+ ei 

(in the statement of \26\ Lem. 24], m must be bounded above by 10, but the bound 10 can be 
replaced by any finite one). The function G is polynomial, let d be its degree. It follows that 
with overwhelming probability, for all r,s, we have 

^p i^vwi^^)- 

|^|< n l/2+ £1 

As a consequence, by the bounds of p3] ), 
E[F(^lx p , q ) | A m (0)] - E[F(V^ M ) | A p>q (0)] = ^ 0(n^~f " £0 ) + O^ 1 ^-^) 

r+s<m 



Now, choosing e\ small enough so that de\ < Eq and (m + 1 + d)e\ < 1/2, we get (25). □ 
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4.4. Proof of Proposition 2.10, Let us first prove that E[(n|u 



,., — l) 4 ] is bounded inde- 
pendently of i,j and n. To do so, let us apply Proposition |4.3| for X' n a GOE or GUE matrix 
and m = 4: it allows to reduce the problem to the case where the 

ti^, j S £11*6 the entries of a 

Haar-distributed matrix. In this case, the distribution of Uij does not depend on i,j and it 
is well known that the moments of \fnu\ i converge to the ones of a standard real or complex 
Gaussian variable. We did not find any concise enough reference for this moments convergence, 
but in the unitary case, Proposition 3.4 of [4] allows to compute E[(n| , ui j i| 2 — l) 4 ] and to verify 
its boundedness. To treat the orthogonal case, the most direct way to compute E[(n|ui i i| 2 — l) 4 ] 
is to use the fact that u\ 1 has the same distribution as Z 2 /(Z 2 + • • • + Z 2 ) for Zi independent 
standard Gaussian variables (see e.g. pjU Lem. 2.1]) and then to use [HI Lem. 2.4]. 

Let us now prove that there is a constant C independent of n and of < s < s' < 1 and 
< t < tf < 1 such that 



(26) 



m E E (M 2 -i/™)} 2 ]<c( s '- s )(t'-t). 

ns<i<ns' nt<j<nt' 



Set / = {i = 1, . . . , n ; ns < i < ns'} and J = {j = 1, . . . , n; nt < j < nt'}. We have 



E[{ e (l« 

ielJeJ 



l/n)f 



n 



n\u, 



(^J»fcl,26(/Xj) 2 1= 



1,2 



Let us now apply Proposition 



4.3 



for XL a GOE or GUE matrix and m = 4. Let UL 



be a Haar-distributed orthogonal or unitary matrix. By Proposition |4.3[ we have 
2 — L i n ) i \ — 1L \w 1 * > > t n j 

=1,2 



e[{ E (I* 

i£l,j<=J 



l/n)} 2 ]=n- 2 0(n- 2 (#I x J) 2 )+n- 2 E E t II 

{k,je)e=i,2e(lx J) 2 



2 -1)1, 



where the term n 2 0(n 2 (#/ x J) 2 ) is uniform in s,s',t,t', hence is bounded, for a certain 
constant Ci, by Ci(s' - s) 2 (t' - t) 2 < Ci(s' - s)(t' - t). So it suffices to prove the result for U' n 
instead of U n . Since each \u[j\ 2 has expectation 1/n, 



>2 \u' • I 



1. 



Then, to compute E[|u^ jj 2 !^' 



E [IlH<iJ 2 -i)] = « 2 E[K >il l 

2 ] , one uses the Weingarten calculus, developed in [3 [TU] . By 



12 ,32 1 



the formulas of [10|. Sec. 6] for the orthogonal group and (9J Sec. 5.2] for the unitary group, 

n + l ,rr nrtho 1 



Wg OTtllo (/d {1)2} ) 
Wg unit (/d {1>2} ) = 



n(n- l)(n + 2)' 
1 



77" 



1 



Wg ortho ((12)) 



Wg umt ((12)) 



n(n- l)(n + 2)' 
-1 



n(n 2 



1) 



(where (12) denotes the transposition with support {1,2}), one can easily verify that 



n 2 E[\u' ilJl \ 2 \u , i2< 



.12 



0(n~ 2 ) if i x ^ i 2 , ii ^ j 2 , 
C^n" 1 ) if (ii = z 2 , ji / j 2 ) or (n ^ i 2l j a 
s O(l) if i x = i 2 , ii =J2, 



J2j 



which is enough to deduce (26). 
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4.5. Proof of Lemma 3.1] To prove Lemma 3.1, we shall use the following proposition, which 
is the obvious multidimentional generalization of Proposition 3.26 of Chapter VI of [17] (note 
also that the result of [T7] is devoted to pocesses on [0,oo) and not on [0, 1], so that the N of 
|17j means nothing to us). 

For / € D([0, l] 2 ) and (so,to) £ [0>1] 2 > we define A SQjto f to be the maximal jump of / at 
(s , to), i-e. 



A SoA) / := max 

o,o'e{<,>} 



/(s , to) - hm/(s,t) 

to' t 



Proposition 4.4. If the sequence (distribution(l? n )) n >i is tight and satisfies 

(27) Ve > 0, P( sup A s t B n > e) — > 0, 

(s,t)e[o,i] 2 n ^°° 

then the sequence (distribution(l? n )) n >i is C-tight, i.e. is tight and can only have C([0, l] 2 ) 
supported accumulation points. 



So to prove Lemma 3.1 let us first prove that the sequence {distribution(B n )) n >i is tight. 
Note that the process B n vanishes at the border of [0, l] 2 . So according to Th. 3] and 
to Cauchy-Schwartz inequality, it suffices to prove that there is C < oo such that for n large 
enough, for all s < s', t < t' G [0, 1], 

m E E (l^-| 2 -l/n)} 4 ]<C7( S '- S ) 2 (t'-t) 2 . 

ns<i<ns' nt<j<nt' 



As in the proof of (26) above, one can suppose that the the entries of a Haar-distributed 

matrix. But in this case, the job has already been done in [11] : the unitary case is treated 
in Section 3.4.1 (see specifically Equation (3.25)) and the orthogonal case is treated, more 
elliptically, in Section 4.5 (to recover the details of the proof, join Equations (3.26), (4.5) and 
(4.17)). 

|2 



Let us now prove (27). Note that sup( s i ) g [ 01 ]2 A Sjt i? n = maxi< 



i,j<n | l^ij 



l/n|. As a 



consequence, by the union bound, it suffices to prove that for each e > 0, there is C < oo 
independent of i,j and n such that for all i, j, 

iij\ 2 - l/n\ > e) < Cn~ 4 , 

□ 



which follows from Chebyshev's inequality and Proposition 2.10 
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